Restauro-G: A Rapid Genome Re-Annotation System for Comparative Genomics
نویسندگان
چکیده
Annotations of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLAST-Like Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/under the GNU General Public License.
منابع مشابه
CoGenT++: an extensive and extensible data environment for computational genomics
MOTIVATION CoGenT++ is a data environment for computational research in comparative and functional genomics, designed to address issues of consistency, reproducibility, scalability and accessibility. DESCRIPTION CoGenT++ facilitates the re-distribution of all fully sequenced and published genomes, storing information about species, gene names and protein sequences. We describe our scalable im...
متن کاملInternational Summer School, ‘ From Genome to Life’
This report from the International Summer School 'From Genome to Life', held at the Institute d'Etudes Scientifiques de Cargèse in Corsica in July 2002, covers the talks of the invited speakers. The topics of the talks can be broadly grouped into the areas of genome annotation, comparative and evolutionary genomics, functional genomics, proteomics, structural genomics, pharmacogenomics, and org...
متن کاملComparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species
Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...
متن کاملReview of Techniques for Gene Sequencing, Annotation and Comparative Genomics
The availability and complete sequencing of many organisms has made comparative analysis of gene a new field of research. The explosion in sequenced genome data on daily basis made this task an enormous one. Several techniques and methods have been devised and applied to carry out genome comparison. In this work, we surveyed and presented an overview of common methods, techniques, tools and cha...
متن کاملProposal for Drosophila as a Model System for Comparative Genomics
The challenge of obtaining a complete annotation of functional genes and regulatory elements in the genomes of higher organisms remains a rate-limiting step to biological discovery. Recently, impressive progress has been reported based on comparative analysis of genome sequences of related species (Boffelli et al. 2003, Kellis et al. 2003). These studies underscore the pressing need to establis...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 5 شماره
صفحات -
تاریخ انتشار 2007